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Abstract 

In this paper we present an extremely simple structure solution method termed charge 
flipping. It works ab initio on high resolution x-ray diffraction data in the manner 
of Fourier recycling. The real space modification simply changes the sign of charge 
density below a threshold, while in reciprocal space the modification is the Fobs map 
without any weighting. We test the algorithm using synthetic data for a wide range 
of structures, analyse the solution statistics and check the quality of reconstruction. 
Finally, we reconsider mathematical aspects of the algorithm in detail, showing that 
in this chaotic iteration process the solution is a limit cycle and not a fixed point. 

1 Introduction 

Ab initio structure solution by x-ray diffraction is a success story of the last century. Today 
the practicing crystallographer can rely on high quality data obtained on cryocooled crys- 
tals by area detectors and synchrotron radiation. In the structure solution process freely 
available software take the workload, a large part of accumulated knowledge is contained in 
their elaborate algorithms. The forefront is indisputably protein crystallography where the 
coordinates of more than a thousand atoms per asymmetric unit can be determined. The 
field of ab initio structure solution is mature, one has the impression that there is no room 
for big surprises. 

Nevertheless, in this paper we present an amazingly simple structure solution algorithm 
- termed charge flipping. This algorithm was inspired by several methods described in the 
literature: phase retrieval in optics [HEIISI, dual space programs SnB and ShelxD [41 E| and 
recent work on iterated projections [HI Ej. All these methods alternate between real and 
reciprocal space by the Fourier transform and do part of the job by imposing constraints on 
the real space charge density. In a sense this is a return to the era before direct methods 
but armed with orders of magnitude more computing power. The charge flipping algorithm 
described below is probably less efficient than state of the art programs today. However, it is 
surprising that it works at all, even in the small molecule world. Its extreme simplicity offers 
the prospect for exact mathematical formulation, raises hopes for further improvements and 
may help to understand the working and limitation of other methods. 



2 On the use of data and grid 



By ab initio structure solution we mean that there is no prehminary chemical or phase infor- 
mation and only a single wavelength diffraction data set is used in the process. Furthermore, 
anomalous scattering is not exploited, atomic scattering factors are taken to be strictly real. 
In this first presentation we use synthetic data and focus on what is feasible given ideal 
conditions. We assume that the diffraction data is complete up to a given resolution, it is 
error free, the absolute scale and global isotropic temperature factor are known. None of 
these conditions are required for the working of the algorithm, we tested the effect of Poisson 
noise, the error in absolute scale and temperature factor. However, only the ideal case is 
discussed here to keep the main message of the paper brief. 

The real space charge density and reciprocal space structure factors are related by the 
Discrete Fourier Transform which is a unitary mapping between the two spaces. For practice 
this is coded as variants of the high speed FFT algorithm. If structure factors are hmited 
by a given resolution then the charge density can be represented on a grid without loss of 
information. The necessary grid spacing is dgHd — dmm/'^ where K = 27r/(imm is the radius 
of resolution sphere in reciprocal space. Charge density pixels are often calculated on a 
finer grid so that contour maps look better. However, this only involves a larger region of 
unobserved structure factors in the calculation and does not provide more information. 

The importance of finite resolution is best shown with a plot. We generated the structure 
factors for a typical organic structure (example 1. of Table 1.). These are the true complex 
amplitudes -F(k) and not just their moduli i\3bs(k) which are used as observed data later. 
The resolution was = 0.8 A, structure factors outside the resolution sphere were 

treated as zeros. Then we calculated the real space charge density on a 0.4 A grid using 
the inverse FFT. Figure l.a shows the sorted charge pixels. The main characteristics is 
the small number of large positive values. Most pixels are concentrated around zero, it is 
exactly this real space property which allows structure solution. Small negative values are 
naturally present because all observed and unobserved structure factors would be needed to 
generate a truly positive charge density. Behind the following algorithm the simple thought 
is: positivity should be forced with care, small negative charge density may help the process 
of structure solution. 

3 The charge flipping algorithm 

The structure factor moduli -Fobs(k) are known for < \k\ = k < K, these are the observed 
data used by the algorithm. Unobserved moduli are treated as zeros throughout the iteration 
process, except F(0) which is initiahzed to zero but later let to change freely. The algorithm 
is initiated by selecting a random phase set {(/^(k)} which satisfies Friedel's law k) = 
— ^^(k). Structure factor amplitudes are created as FobsC*''' and an inverse FFT gives a real 
charge density p(r). This is our starting point in real space. 

Then one cycle of iteration goes from real space through reciprocal to real space again 
according to the following scheme: 
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F < G 

The charge density modification p(r) g{r) starts the current iteration cycle. It uses a 
positive threshold 6 for the charge pixels. The value of 5 is a fraction of a typical light atom 
peak and is the only parameter of the algorithm. As the height of atomic peaks depends on 
the grid size, data resolution and thermal parameter, these factors also affect the choice of 6. 
Pixels above this value are accepted unchanged on the assumption that they belong to atomic 
peaks. Pixels below 6 are simply multiplied by —1 which is made plausible later. In the 
next step temporary structure factors G(k) are calculated by an FFT. Then structure factors 
F(k) are constructed by accepting phases and replacing the moduli by Fobs(k). F{0) = G(0) 
is accepted as is without fixing or limiting its value, and -F(k) for k > K are reset to zero. 
Finally the -F(k) amplitudes are inverted to obtain the new approximation of the charge 
density p(r). This unconditional iteration process can continue without intervention, the 
traditional i?-factor or some other figure of merit serves only for monitoring and not as the 
objective function of an optimization approach. 

Symmetry is an important issue. Following positive experience jHl E], we handle all 
structures in the spacegroup PI and neglect any symmetry constraints. Accordingly, nothing 
fixes the origin which is an advantage, the structure can emerge anywhere. The disadvantage 
is that the charge density of the whole unit cell must be determined and not just that of the 
asymmetric unit. It turned out that the first factor is more important. When we applied the 
symmetry constraints of a given structure and thus forced the origin to a particular pixel of 
the unit cell, the success rate of the algorithm became much worse. 

The algorithm is local in both spaces, modification of charge pixels and structure factors 
occurs only in-place. In real space only the charge pixels below +6 are modified which can 
be further divided into two parts. Large negative values below —6 are flipped simply to 
force positivity. More interesting is the [—6, +6] range which is not negligible, it gives a 
substantial contribution to the structure factors. Figure l.b shows the sorted charge pixels 
of a typical solution. The [—6, +6] range is roughly linear which is an approximation of the 
target charge density. Flipping this region does not significantly change the distribution of 
pixels, but at the same time sufficiently explores the phase space. In reciprocal space the 
modification of structure factors corresponds to the unweighted Fobs map. The treatment of 
the unobserved F{0) is less standard. While its value equals the total charge and could come 
from the chemical composition, we do not make use of it, keeping the algorithm ab initio 
in the strict sense. F{0) is initialized to be zero and is let to change freely in the iteration 
cycles. In our studies this approach worked better than fixing the total charge. Note that 
in this simple scheme there is no reciprocal space weighting, no tangent formula and no use 
of probability. The concepts of atomicity and positivity are there but in a strange, indirect 
way. 

During prolonged tests of the algorithm we realized that it is closely related to the 
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solvent flipping method of Abrahams and Leslie ^21 used as density modification in 
protein crystallography. However, there are important differences. Solvent flipping - as 
other methods of density modification - is used for improving already existing phases and 
does not need atomic resolution data. In contrast, the charge flipping algorithm of this work 
is used ab initio and high resolution data is essential for its success. Solvent flipping requires 
the existence of separate solvent and protein regions. It modifies only the solvent charge 
density as p^^^ = Po + ^flip " (p ~ Po); where po is the expected solvent level and fcflip depends 
on the solvent content. In contrast, the charge flipping algorithm does not need separate 
real space regions, it is apphed everywhere. There is no choice of po or kfnp, the modification 
is always the sign change of pixels below the threshold parameter 6. The low density region 
occupies the space between atoms, which is automatically found and perpetually adjusted 
by the algorithm. 

In the following two sections we first give several examples of ab initio structure solution 
using charge flipping and then discuss mathematical aspects of the algorithm in detail. 

4 Structure solution examples 

We tested the charge flipping algorithm on more than 200 structures taken from the Cam- 
bridge Structural Database. For this presentation we selected ten examples in the simplest 
centrosymmetric and non-centrosymmetric spacegroups each with a considerable number of 
atoms. The structures are listed in Table 1. 

Table 1: Example structures. Columns: CSD code and original reference, spacegroup, 
number of non-hydrogen atoms and chemical formula per unit cell 





code and ref. 


spgr. 


N 


unit cell content 


1. 


feryoq |12j 


PI 


172 


2 ■ CsoiViOs 


2. 


rawtoy 


PI 


216 


2 ■ C88iV40i6 


3. 


ibeyap 


PI 


220 


2 • CggiViOia 


4. 


cotgib J3] 


PI 


244 


4 ■ 


5. 


sisyey [lEj 


PI 


326 


2 ■ CgsChMnuNiO^o 


6. 


valino [T7j 


PI 


156 


2 • C54iV60i8 


7. 


pawveo ^Hj 


PI 


164 


2 • C72N4OQ 


8. 


gofmod 


PI 


188 


2 • Ctj,^N40i2.5 


9. 


qarpuu [201 


PI 


220 


2 • Co^NM 


10. 


qibbuy pij 


PI 


240 


1 ■ Ci8iC/24-^6026-P3 



In all cases we generated data up to 0.8 A resolution using the coordinates and scattering 
factors of non-hydrogen atoms and without adding noise. Furthermore, we assumed that the 
absolute scale and the isotropic thermal parameter B are known. In practice these come from 
Wilson's plot. The knowledge of the absolute scale is not a serious issue, it is simply related 
to the proper choice of the 6 parameter. With trial and error we can quickly find its realistic 
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range and fine tuning is needed only for faster convergence. In our examples 15% accuracy 
of 5 is sufficient. Using B=0 needs more consideration. Thermal vibration smears out the 
atomic charge density and weakens atomicity on which the algorithm is based. Therefore, it 
is strongly preferred to use low temperature data. How well the effect of B can be removed 
from the real data is beyond the scope of this paper but our numerical tests show that an 
error of ~3A2 can be tolerated. 

All ten example structures were successfully solved using the charge flipping algorithm. 
The solution of each structure was attempted 100 times starting with different random 
phase sets and running the algorithm for a maximum of 5000 iteration cycles. The number 
of iterations leading to convergence greatly varies, only their distribution characterizes the 
difficulty of the problem. Solution statistics is compiled in Table 2. 

Table 2: Solution statistics of the example structures. Columns: S parameter in units of 
6 • Vpixei, success rate, mean/minimum/maximum number of iterations 





6 


success 


mean 


min. 


max. 


1. 


0.30 


0.99 


338 


55 


2005 


2. 


0.28 


1.00 


301 


55 


1650 


3. 


0.30 


1.00 


90 


30 


205 


4. 


0.37 


1.00 


101 


15 


230 


5. 


0.42 


1.00 


143 


70 


300 


6. 


0.28 


0.95 


1040 


115 


4220 


7. 


0.30 


1.00 


106 


35 


345 


8. 


0.30 


1.00 


268 


40 


1645 


9. 


0.42 


1.00 


198 


75 


690 


10. 


0.47 


1.00 


441 


85 


4115 



It is informative to follow some basic quantities during the iteration. Figure 2. shows a 
typical run of example 1. The three subplots arc: the total charge, the traditional i?- factor 
and the phase-change. In all three quantities a sudden decrease starts at 210 iterations and 
ends after another 10 iterations. This sharp drop is an unmistakable sign of convergence and 
its width is independent whether it occurs after 10 or 10000 iterations. All curves show three 
different parts: an initial transient, a long stagnation period before the convergence and an 
equilibrium after. What really goes on in these periods is discussed in the next section. 

Once a solution is found its quality must be evaluated. For this we locate the atoms by 
3x3x3 pixel peak picking and compare their number, centroid position and integrated 
weight to the original structure. The solutions are remarkably complete, all atoms of the 
original structure can be found. As we work in the spacegroup PI the structure is always 
shifted relative to the original and for non-centrosymmetric structures the solution is often 
the enantiomer. When we check a large number of solutions the shift vector is uniformly 
distributed in the unit cell. After applying the shift and enantiomer correction the coordi- 
nates of non-hydrogen atoms are typically within 0.1 A from the original structure and the 
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integrated weight of a carbon atom scatters between 4 to 6. This is considered very good 
quahty reconstruction, especially without the use of a separate refinement program. 

5 Mathematical notes on the algorithm 

Without giving a formal proof of convergence, we reconsider here some of the mathematical 
aspects of the iteration method described previously. 

Clearly, a prerequisite of any ab initio structure solution is that apart from translations 
and point group transformations, the Fourier moduli determine a unique density. Thus, 
we suppose uniqueness and mention only one obvious condition of it. The density should 
not be strictly positive, otherwise any sufficiently small change in the phases which respects 
Priedel's law would lead to a different, nonnegative density. 

Our method assumes that the density has extended regions of zeros. If the density 

Pideal(r) = ;^5]F(k)e-''"^ 
k 

has a sea of zeros then the value taken by the finite sum 

k<K 

is small positive or negative near this sea of ideal zeros. The discrete inverse Fourier transform 
provides a sampling of p at the centres of pixels and not an average over the volume of pixels. 
The oscillations around zero can be seen in the samphng, and the threshold 5 > under 
which the sign fiip is made has to be chosen in such a way that these small oscillations fall 
in the interval [—5, +5]. Therefore, the optimal choice of 5 depends on the function to be 
determined. We demonstrated earlier that 5 can be chosen without an a priori knowledge 
of p so that the algorithm converges, and for this reason 5 should not be too small. 
Given 5 > 0, we divide p in two parts, p = pi + P2 with 

n (r)-! ^^""^ ^(""^ - ^ (9) 

P^^""' " \ otherwise 

and 

P^^""' - \ otherwise ^'^^ 

Since the input data are Fobs(k) = |i^(k)| for < A; < iiT, the target function cannot be 
Pideai but only p, including the total charge 




6 



The iteration will generate F{0), although with a limited precision because it sensitively 
depends on the choice of S. If we are given Pideai(r) ~ "^-^(O), the charge F{0) can be found 
by knowing that the fiat regions of pideai have to be at zero level. No similar information 
about -F(O) can be used if we know only p(r) — yF{0). The best we can hope for is to 
reproduce pi whose mere definition depends on the positivity of 6. Another fundamental 
reason to work with 5 > is that for any set of phases {(y9(k)} and a constant c large enough 

0<A;<A' 

This means that any set of phases {<y9(k)} is a fixed point of the iteration if 5 = and F{0) 
is not prescribed. 

The scheme of iteration has been given in equation ([Q). More precisely, we do the 
following: 

0th half-cycle. 

We choose ip^^^k) for (the half of) k with < k < K independently, according to the 
uniform distribution in [0,27r]. Then 

F(o)rkl = / ^obs(k)e^^*°^« for 0<k<K , . 



^0 for k = and k > K. 

and by inverse FFT we compute p^*^-* determined in pixels r = r^. 
nth cycle (n > 1). 

Given p^"^^\ we divide it in two parts, 

p^-'^ = pt'^ + pt'^ (5) 
as in Q and (jSJ, and execute the sign flip on p("~^^(rj) < 5 to obtain 

9^ -Pi -P2 ■ (6) 

The Fourier transform of (^'•"^ provides G'''"^(k) for as many k as the number of pixels in the 
unit cell. Then 

r Fobs(k)G'(")(k)/|G(")(k)| =Fobs(k)e*^*"'« for 0<k<K 
F(")(k) = <^ ^^(O) for k = (7) 

( ioT k> K. 

and through inverse FFT we find the next approximation p'^")(r) of the density in pixels 
r = r^. 
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The real space transformation 

p{n-l) _^ g(n) 

(i) non-invertible, i.e. one cannot reproduce from g^'^'^ 

(ii) norm-preserving 

E^^"Hr,)^ = EP^""'nr,)^ (8) 
j j 

(iii) local in the sense that g^'"'\rj) depends only on p^''^~^'> [r j) . 

Together with (jHJ this implies \g^'^\vj)\ = \p^"'~^\vj)\. Its most important characteristics is, 
however 

(iv) sign change in a broad region of the unit cell, p^'^~^\rj) < 5 occurs for the majority 
of pixels. 

Locality in real space also means no charge displacement, implying that the position of 
the density evolves freely. This has consequences on the transformation we do in reciprocal 
space. Since the real space modification makes no use of symmetries, symmetry constraints 
on the phases of the Fourier components are not helpful or counter-productive. As we see 
from equation ((7j), the transformation G^^^ — * F^^^ in reciprocal space is also local and 
non-invertible but does not preserve the norm 

5^|F(")(k)|VEl^^"Hk)P (9) 

k k 

until convergence has not reached. After convergence, instead of © equality will hold, but 
not term by term. The step from g^^^ to G'-"^ always creates nonzero Fourier components 
for k > K, while F^^\'k) = for k > K. This means that 

J2 |G(")(k)p< |FW(k)p= J2 FoM" (10) 

0<k<K 0<k<K 0<k<K 

and therefore 

|G'W(k)|<|F(")(k)| = Fob.(k) (11) 

for the largest structure factors, dominating the sum of the squares. 

We emphasize that in the present algorithm convergence means reaching a limit cycle and 
not a fixed point, in the sense that |G'(")(k)| becomes independent of n, but <^'^")(k) alternates 
between two values according to the parity of n. In Figure 3. we plotted the evolution of 
(^k^ in the complex plane for a few strong reflections. This is more spectacular than the 
evolution of F^")(k) which stays on the circle of radius Fobs(k). Equation © imphes 

p{n-l) ^ p{n-l) ^ p{n-l) ^^2) 

where F^''" is the Fourier transform of p-" ^\ Thus, 

According to (|7j), F*^") is a functional of F^^ — f!^ ^\ but not of F'^^~^'> which is not 
uniquely determined by F^^ — ^\ When convergence sets in F/"^(k) becomes nearly 
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independent of ra, while F2 (k) alternates between two nearly coUinear vectors in the com- 
plex plane which are nearly orthogonal to their respective fI^\\<.). That collinearity and 
orthogonality are imperfect is due to (fTT|) and (fT^ . As a result, we can see an even-odd 
alternation of G*^"^(k) inside the circle of radius -Fobs(k), as shown in Figure 4. 

If the number of independent phases is A^, the algorithm has to find one of the good 
phase sets in the A^-dimensional real space M^, where good means reproducing a translate 
of pi. Any good set is represented by a point in M^, and these points form a three- dimen- 
sional manifold S having one or several connected components. If p(r) 7^ p(— r) then their 
respective translates generate different connected sets. $ = {(y9(k)} and \l/ = {■?/'(k)} are in 
the same component if they are connected by a space translation, 

^(k) = (p(k) + k ■ a 

for some a G M^. Phase retrieval is done in the cube C = [0,27r]^, and hence parts of 
S outside this cube have to be shifted back into it by subtracting integer multiples of 27r. 
Then even the connected components of S fall into three dimensional 'filaments' starting 
and ending on the surface of the cube, and our algorithm has to converge to a point of one 
of the filaments. Although S is infinite, when shifted back into C, the filaments do not fill 
densely the cube, otherwise any choice of the phases would do. Other than a circumstantial 
evidence of this fact can be obtained by noting that for any four vectors ki, . . . , k4 and any 
a G the numbers kj ■ a are rationally dependent, i.e. ^2^=1 '^i^i ■ a = for suitably 
chosen integers mj. Also, there is a large number of shift-invariant quantities formed by the 
coordinates of points of S", namely, if k' = mk for some integer m then 

(^(k') — mip{\i) 

is shift-invariant. 

Having no a priori information about the position of the filaments, a random initial set 
of phases seems to be a good choice to start with. This is even more so, because starting with 
a good phase set the iteration leaves the neighbourhood of this point and returns to another 
one after convergence. Indeed, starting with the good phases implies p'^°^ = p — J p dr. 
Then in real space, instead of shifting this function upwards, we start to flip the values 
below 5 and the iteration leads farther away from p before it approaches again a translate 
of it. That it does, is due to property (iv) of the real space transformation which invokes 
a wide exploration of C . We note that there is no attraction along the filaments, because 
the algorithm makes no preference in the position of the sample. It takes a while to reach 
the basin of attraction of one of the filaments, showing that their complement in C has 
to be a large set of a complicated structure, similar to a 'strange repellent' of a chaotic 
system. However, in the neighbourhood of a solution convergence sets in like an avalanche, 
it is complete after a few steps of iteration. The iteration process is chaotic, in the usual 
sense of sensitive dependence on initial conditions, which are the starting phases in our case. 
An obvious sign is that the number of iteration cycles leading to convergence varies wildly 
with the slightest change of a given random phase set. Chaotic behaviour comes from two 
sources. The topological reason is the filamentation of the set of points to be attained in 
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the very high dimensional cube of phases. The algorithmic reason is that successive points 
to be visited in C are decided by charge flipping in real space. Although this is done in 
a deterministic way, during the long stagnation period it appears very much like random 
jumps in the space of phases. 

It is instructive to follow the evolution of the total charge F'^"')(0) plotted in Figure 2. a. 
It starts with zero, in the first step of the iteration jumps to a high positive value, which is 
followed by a rapid decay to an intermediate value, a long stagnation, and a second abrupt 
drop to the final number. Its evolution follows rather closely that of the i?-factor, 

^(„) ^ Eo<.<Kl|g^"Hk)|-F.,.(k)| ^^^^ 

^0<k<K -^obs(k) 

shown in Figure 2.b. The distance 

^(n)) ^ Eo<.,.i^obs(k)|M-+^)(k)-^W(k))(mod2vr)| ^^^^ 

exhibits a similar behaviour (Figure 2.c). We recall that the i?-factor is used only for 
monitoring the convergence, the fact that it does not tend to zero is due to and has no 
bearing on the success of the iteration. 

The initial overshooting of F^^\Qi) for most of the starting phase sets can be understood 
as follows. Since 

GW(0) = F(°)(0) =0 

and 

Ff\Q)= [pf'>dr = - /pfdr>0 



is the charge carried by the pixels in which p^^^rj) > S, we have 

G(i)(0) = F«(0) = 2Fi^°^(0) . 
No similar charge doubling occurs in the subsequent steps, because typically 

p^dr > - / p^")dr 



for 77, > 0. For the example in Figure 2. the actual limit reached by ^'^"^•'(0) is about 30% 
lower than the true value of F{0). In principle, a smaller 6 could yield the correct value, 
but in practice we would not obtain convergence with a smaller 6. After convergence sets in, 
6 can be decreased and the iteration continued without destroying the result. A new limit 
cycle will be attained with suppressed oscillations and a higher total charge. This can go on 
down to 5 = 0, where the sea of zeros will be lost and the total charge will become too large. 
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6 Conclusion 



In this paper we presented an ab initio structure solution method termed charge flipping. It 
uses high resolution data and alternates between real and reciprocal space in the manner of 
Fourier recycling. The real space modification simply changes the sign of the charge density 
below a threshold 5, which is the only parameter of the algorithm. In reciprocal space, 
observed moduli are constrained using the unweighted Fobs map while F{0) corresponding 
to the total charge is allowed to change freely. All structures were handled in spacegroup PI 
neglecting any symmetry constraints. 

We tested the algorithm using 0.8 A resolution synthetic data for a wide range of cen- 
trosymmctric and non-centrosymmetric structures taken from the Cambridge Structural 
Database. The size of our examples is somewhere at the upper limit of small molecule 
structures in spacegroups PI and PI. While ideal data are not required for the working 
of the method, at this stage of development we needed a large pool of various structures 
and we had to be sure that any difficulty of solution is not the consequence of data quality, 
resolution or completeness. For the examples presented we followed single runs, analysed 
solution statistics of multiple runs and checked the quality of reconstruction. Here we only 
emphasize that all structures were solved with a high success rate, and all atoms were found 
without the use of a separate refinement program. 

Finally, mathematical aspects of the iteration process were considered in detail. We clar- 
ified the most important properties of real and reciprocal space transformations, discussed 
the relevance of finite resolution and the choice of 6. By following the evolution of structure 
factors we also showed that the iteration process is chaotic and the solution is not a fixed 
point but a limit cycle. 

The most important characteristics of the algorithm is its amazing simplicity. This is 
a big advantage for exact mathematical treatment, in the future we shall attempt to give 
a formal proof of convergence. When it comes to efficiency, such a simple method is likely 
to lag behind today's best programs. As long as symmetry is not helpful, charge fiipping 
offers only an interesting alternative in the low symmetry spacegroups. Nevertheless, we 
encourage everyone to try it, it is a few lines of code plugged in an existing program, and 
only a few hundred lines as a stand alone application. We shall also continue our work to 
check the power of the algorithm on real data, and to make a fair comparison of success 
ratios to other methods. We have well defined plans to improve the algorithm but anticipate 
that the original simplicity will be lost in exchange for higher efficiency. 

We thank Gyula Faigcl, Miklos Tcgzc and Gabor Bortel for useful discussions. This 
research was supported by OTKA grants T043494 and T042914 and the work of G.O. was 
also funded by a Bolyai Janos Scholarship. 
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Figure 1: Charge density of a typical organic structure at 0.8 A resolution, a: target charge 
density, b: solution after convergence. Pixel values are sorted in ascending order and are 
normalized to the maximum of the target. Note that the [—6, +6] range within horizontal 
lines is approximately linear. 
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Figure 2: A typical run of the charge flipping algorithm leading to convergence. From top 
to bottom: total charge, i?-factor and phase-change as a function of the iteration cycle. The 
total charge is normalized by its ideal finite resolution value. The i?-factor and phase-change 
are defined by equations (fT^ and (fT^ . 
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Figure 3: The evolution of the structure factor amphtude ^^"^(k) in the complex plane for 
a few strong reflections. In each case it alternates between two values after convergence, 
showing that the solution is a limit cycle. 
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Figure 4: The evolution of a selected structure factor amplitude G*^")(k) in the complex 
plane. Arrows explain one cycle of iteration from G^"^ to G^^^^) through F^'^^—F^^^ + ^3"^ 
The radius of the circle is Fobs(k) . 
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